A Hybrid Multi-group Privacy-Preserving Approach for Building Decision Trees

نویسندگان

  • Zhouxuan Teng
  • Wenliang Du
چکیده

In this paper, we study the privacy-preserving decision tree building problem on vertically partitioned data. We made two contributions. First, we propose a novel hybrid approach, which takes advantage of the strength of the two existing approaches, randomization and the secure multi-party computation (SMC), to balance the accuracy and efficiency constraints. Compared to these two existing approaches, our proposed approach can achieve much better accuracy than randomization approach and much reduced computation cost than SMC approach. We also propose a multi-group scheme that makes it flexible for data miners to control the balance between data mining accuracy and privacy. We partition attributes into groups, and develop a scheme to conduct groupbased randomization to achieve better data mining accuracy. We have implemented and evaluated the proposed schemes for the ID3 decision tree algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A hybrid intuitionistic fuzzy multi-criteria group decision making approach for supplier selection

Due to the increasing competition of globalization, selection of the most appropriate supplier is one of the key factors for asupply chain management’s success. Due to conflicting evaluations and insufficient information about the criteria, Intuitionisticfuzzy sets (IFSs) considered as animpressive tool and utilized to specify the relative importance of the criteria. The aim of this paper is to...

متن کامل

Decision Tree Classifier for Privacy Preservation

In recent year’s privacy preservation in data mining has become an important issue. A new class of data mining method called privacy preserving data mining algorithm has been developed. The aim of these algorithms is to protect the sensitive information in data while extracting knowledge from large amount of data. We focus the general classification in a secured manner and introduce a privacy-p...

متن کامل

Building Privacy-preserving C4.5 Decision Tree Classifier on Multi- Parties

In this paper, we address Privacy-preserving classification problem in a multi-party sense. We focus the general classification in a secured manner and introduce a Privacy-preserving decision tree classifier using C4.5 algorithm without involving third party. C4.5 algorithm is a software extension of the basic ID3 algorithm designed by Quinlan. Our protocol is considerably more efficient than a...

متن کامل

Privacy Preserving Data Mining using Random Decision Tree

Data processing with information privacy and information utility has been emerged to manage distributed information expeditiously. In this paper, to deal with this advancement in privacy protective data processing technology victimization intensify approach of Random Decision Tree (RDT). Random Decision Tree provides higher potency and information privacy than Privacy secured Data mining Techni...

متن کامل

A centralized privacy-preserving framework for online social networks

There are some critical privacy concerns in the current online social networks (OSNs). Users' information is disclosed to different entities that they were not supposed to access. Furthermore, the notion of friendship is inadequate in OSNs since the degree of social relationships between users dynamically changes over the time. Additionally, users may define similar privacy settings for their f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007